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We study the statistical behavior under random sequential renormalization (RSR) of several 
network models including Erdos-Renyi (ER) graphs, scale-free networks, and an annealed model 
(AM) related to ER graphs. In RSR the network is locally coarse grained by choosing at each 
renormalization step a node at random and joining it to all its neighbors. Compared to previous 
(quasi-) parallel renormalization methods [Song et. a/.. Nature (London) 433, 392 (2005)], RSR 
allows a more fine-grained analysis of the renormalization group (RG) flow and unravels new features 
that were not discussed in the previous analyses. In particular, we flnd that all networks exhibit 
a second-order transition in their RG flow. This phase transition is associated with the emergence 
of a giant hub and can be viewed as a new variant of percolation, called agglomerative percolation. 
We claim that this transition exists also in previous graph renormalization schemes and explains 
some of the scaling behavior seen there. For critical trees it happens as N/Nq ^ in the limit of 
large systems (where A^o is the initial size of the graph and N its size at a given RSR step). In 
contrast, it happens at flnite N/Nq in sparse ER graphs and in the annealed model, while it happens 
for N/No ^ 1 on scale- free networks. Critical exponents seem to depend on the type of the graph 
but not on the average degree and obey usual scaling relations for percolation phenomena. For the 
annealed model they agree with the exponents obtained from a mean-field theory. At late times, the 
networks exhibit a star-like structure in agreement with the results of Radicchi et. al. [F. Radicchi 
et. al. Phys. Rev. Lett. 101, 148701 (2008)]. While degree distributions are of main interest 
when regarding the scheme as network renormalization, mass distributions (which are more relevant 
when considering "supernodes" as clusters) are much easier to study using the fast Newman-Ziff 
algorithm for percolation, allowing us to obtain very high statistics. 



I. INTRODUCTION 

Complex networks provide a useful representation for 
complex phenomena in a variety of settings including so- 
cial, biological and technological systems, and have been 
studied extensively in the past decade [T]-[3]. A common 
property of many complex real world networks is the het- 
erogeneity of nodes leading to wide (power law, "scale- 
free") degree distributions [4 . 

For systems embedded in Euclidean space, scale-free 
statistics is often related to the notion of self-similarity. 
In statistical physics and critical phenomena this is usu- 
ally studied by using the renormalization group (RG) 
technique, where degrees of freedom of the system are 
eliminated successively by coarse-graining. The scaling 
behavior of the systems close to the fixed point of the 
RG flow is then examined and systems with similar scal- 
ing behavior are classified into universality classes [5j |6] . 

While renormalization is well defined and extensively 
studied for spatially extended systems (including regular 
lattices and disordered systems), it is not clear whether 
it can be applied to complex networks that have no spa- 
tial structure, where the topology is given only by the 
network itself. Naively one would expect that the "small 
world" property displayed by many real networks [71 [8] 
means that they cannot be embedded in any finite dimen- 
sional space, and thus renormalization schemes should 
be less useful. Nevertheless, a real space renormaliza- 
tion transformation for such networks was introduced 
by Song et al. [9l |T0]. In this scheme, the entire net- 



work is covered in each RG step by a set of boxes, and 
each box is considered as a "supernode" in the next RG 
step. Several complex networks were claimed to have a 
finite self-similar or fractal dimension, i.e. the number 
of boxes needed to cover the network seemed to show 
a power-law relation with the diameter of the box, in 
blatant contradiction to their small world property. Al- 
though this issue was never solved, it was suggested that 
the fractality of real world networks depends on self or- 
ganization in the growth mechanism [11 , assortativity of 
fractal networks [12], and fractality of their underlying 
structure f TSUTT] . 

This confiict between the "small-world" property and 
any fractality of complex networks was avoided by Radic- 
chi et al. [18, 19 by using an RG analysis based on the 
same box covering idea, but studying carefully the RG 
fiow itself, without using any length scale dependence for 
making claims about fractality. 

There are some technical concerns in these previous 
box covering methods for renormalizing networks. First, 
according to the original idea of Hausdorff [20j , the sizes 
of boxes should be individually optimized, whereas in 
the suggested methods all boxes are of equal size. This 
is a particularly severe problem due to the heterogeneous 
connectivity in complex networks that leads also to very 
wide distributions of nodes per box, most of them being 
nearly empty. Secondly, even when boxes of the same 
size are used, the precise placement of boxes strongly 
affects the result, and optimizing their positions is not 
practically feasible. Although the suggested methods in 
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Refs. [SHUl [13] are claimed to overcome this problem, 
their results still depend on the order at which the boxes 
are laid down, making these schemes quasi-sequential. 
In particular, the number of nodes per box decreases 
strongly with the number of boxes already put down. 
Finally, during each RG step the size of the network de- 
creases dramatically, which results in a small number of 
data points in the RG flow. For networks with small- 
world property this is particularly serious, as the diame- 
ter of the networks scales only with log(A^) {N being the 
size of the network). To compensate this, only parts of 
the network have been coarse-grained in Ref. [17] at each 
step of renormalization, which adds more complexity to 
the process and makes the results even more difficult to 
interpret. 

In our previous work ^21^, we suggested a completely 
sequential renormalization scheme for undirected and un- 
weighted graphs, called Random Sequential Renormal- 
ization (RSR). In RSR at each step of renormalization 
one node is chosen randomly, and all nodes within a 
given distance b are replaced by a single super- node. All 
links from the outside to the (removed) neighborhood 
are redirected to the super-node, and the super-node is 
then treated like any other node in the network. The 
parameter b is called the box radius. 

RSR has the advantage that it does not involve any op- 
timum tiling and is very easy to code and understand. It 
avoids the problem of mostly-empty boxes. Furthermore, 
as the network is affected only locally and the decima- 
tion is considerably less at each step of RSR, the whole 
flow generates much more statistics which allows a more 
detailed analysis. 

Another advantage of RSR is that it can be interpreted 
as a cluster growth process, where initially all nodes are 
considered to be clusters of mass one. At each step of 
RSR a randomly chosen cluster grows by agglomerating 
with all its neighboring clusters. Using the fast Monte 
Carlo algorithm for percolation introduced by Newman 
and Ziff (NZ) [22l |23], RSR can be easily implemented 
on networks with millions of nodes. 

In our first paper on RSR [21 , we applied this method 
to critical trees. Their simple structure makes it possi- 
ble to study the renormalization flow analytically, giving 
perfect agreement with results from numerical simula- 
tions. We found three regimes in the evolution of crit- 
ical trees under RSR: (i) An initial regime with small 
fluctuations in the region Nq^'^ <N<No (with A^o be- 
ing the initial size of the network and N its size at a 
given renormalization step); (ii) An intermediate regime 

for A^o^^ ^ N ^ where the network is a fat, short 

tree whose structure is dominated by a giant hub. The 
transition between these two regimes is associated with 
emergence of a power-law degree distribution and is de- 
scribed by crossover functions exhibiting finite-size scal- 
ing; (iii) A third regime extending down to N = 1 where 
the network is a star with a central hub and many leaves. 

The appearance of power-law distributions and scaling 
is associated with a continuous transition, called "ag- 



glomerative percolation" (AP) [2T. In one dimension 
(i.e. graphs consisting of a simple 1-d chain), AP has 
been solved exactly [25l [26] . There it shows non-trivial 
scaling with exponents that depend on the box size [25^ . 
In two dimensions, AP is for triangular lattices in the 
same universality class as ordinary percolation (OP), 
whereas it shows different critical behavior for square lat- 
tices [24 . This is related to the bipartite structure of the 
square lattice as every site on the boundary of any clus- 
ter is on the same sub lattice [27^ . The fact that patently 
non-fractal structures like one and two dimensional lat- 
tices also exhibit scaling under RSR suggests that some 
of the scaling laws previously found in small-world net- 
works are due to agglomerative percolation transition, 
rather than any underlying fractality of most networks. 



In the present paper we study the behavior of sparse 
Erdos Renyi (ER) graphs and of the scale-free model of 
Barabasi and Albert (BA) [4 under RSR. For sparse ER 
graphs under RSR with 6 = 1, we find a continuous per- 
colation transition at finite x = N/Nq. Using finite size 
scaling methods, we show that the corresponding critical 
exponents are consistent with a scaling theory based on 
two independent exponents. Within our error estimates, 
these exponents appear to be independent of the initial 
average degree of the ER graphs. For the BA model the 
transition seems to be pushed to x = 1, which makes it 
more difficult to obtain precise numerical results. 



We also study RSR analytically using a mean-field 
theory based on generating functions. The behavior of 
graphs before the AP transition is consistent with this 
theory. After the transition the theory fails due to large 
fluctuations, as well as due to the effect of loops that 
are negligible before the transition. The predictions of 
the theory are in agreement with our simulations of an 
annealed model. 



We introduce our model and simulation method in 
Sec. II, where we also define the graphs and the sizes 
of the ensembles under study. Section III presents our 
simulation results for ER graphs. We show evidence of a 
continuous percolation transition, find the scaling prop- 
erties and the corresponding critical exponents numeri- 
cally, and show that they obey the common scaling re- 
lations of ordinary percolation. In Sec. IV we develop 
a mean-field theory for the evolution of ER graphs un- 
der RSR, and compare its results with our simulations of 
an annealed model. We discuss the behavior of graphs 
beyond the percolation transition in Sec V. Sections VI 
and VII are devoted to the results of simulations on ER 
graphs with different average degrees as well as RSR with 
larger box sizes. Finally in Sec. VIII we examine BA net- 
works, and we conclude our study in Sec. IX. 
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FIG. 1. (color online) One step of RSR with radius 6=1. 
The randomly chosen target node (yellow/white in print) ab- 
sorbs all its nearest neighbors (red/light gray in print). All 
links to the absorbed nodes (blue/dark gray nodes) are then 
redirected to the target. Alternatively one can view the su- 
pernode as a cluster that grows by eating all its neighboring 
clusters. RSR with any b > 1 can be performed by applying 
the above procedure on the same target b times. 



II. THE MODEL 

A. Random Sequential Renormalization 

Random sequential renormalization, RSR, with radius 
b {b = 1,2,...) is the process of consecutively applying a 
local coarse-graining (one step of RSR), on a given net- 
work Go, which leads to a series of graphs Gt{0 < t < T) 
with strictly decreasing sizes Nf. In the following, t is 
called time. The initial graph at t = has A^o nodes, and 
the final graph at t = T is just a single node. We also 
assign masses to the nodes (initially rui = l;Vz G Go). 
For any undirected and unweighted graph with N nodes 
of masses m^, one step of RSR (as shown in Fig. [T]) 
proceeds as follows: 

(i) Choose randomly one of the nodes in the graph as 
the target. 

(ii) Define the neighborhood JV around the target to 
include all nodes within a distance d < b from the target. 
Distance is measured by the number of links that make 
the shortest path between two nodes. 

(iii) Delete all the nodes in A/", except for the target. 

(iv) Delete all the internal links of JV. 

(v) Redirect to the target all links that connect nodes in 
A/" to the rest of the network. 

(v) If a multiple link appears, replace it by a single link. 

(vi) Update the mass of the target to m = m^, with 
ieAf. 

Hence, the target node and all its neighbors up to dis- 
tance b in the network are replaced by a super-node. This 
process preserves all the links to the outside but discards 
the internal details in the target's neighborhood - ana- 
log to course-graining in real space renormalization. The 
super-node is then treated like any other node in the net- 
work. We consecutively repeat this procedure until the 
graph is reduced to a single node. Alternatively, one can 
also define RSR such that the target node is chosen with 
probability proportional to its mass [24l |26] or degree. 



but we only discuss the unweighted form here. 

For 6 = 1 the target absorbs only its nearest neigh- 
bors. The easiest way to implement RSR with any b > 1 
is to apply RSR with 6 = 1 on the same target for b 
successive steps. Although this is slightly slower than an 
optimal coding, we use it in our simulations to reduce 
code complexity and potential sources of errors. 

As indicated in Fig. [l] RSR can also be interpreted as a 
cluster growth process on the graph. The target cluster 
is chosen at random and grows by absorbing all clusters 
within distance b of it. Hence the fast Newman-Ziff (NZ) 
algorithm for growth of percolation clusters can be easily 
adapted to this problem, and it makes sense to speak of 
a percolation transition beyond which one of the clusters 
occupies a finite fraction of the nodes. 

B. The graph ensembles under discussion 

We mainly focus on connected Erdos Renyi (ER) 
graphs with average degree (k) slightly larger than 2. 
The ensemble is produced in the following way: For each 
graph size A^o we make several ER graphs with fixed 
size > No and a fixed number of links such that 
the average degree (/c)* = 2, and determine their giant 
component (which contains about 80% of the nodes for 
this value of (/c)*). If the size of the giant component is 
A^o ± 1% - corresponding to A^o = (0.80 ± O.Ol^A"* - we 
add the giant component to the ensemble; otherwise it 
is discarded. Notice that this leads to a slight scatter of 
A'o and of the average degree of the graphs at the start 
of RSR. The latter is ^ {k)o = 2.4. For each N* the en- 
sembles typically contain ^ 10^ networks, and we apply 
several realizations of RSR on each of them. 

We also examine RSR on ER graphs with (/c)* 7^ 2, as 
well as the scale- free model of Barabasi and Albert [4 . In 
each case the ensemble of connected graphs is generated 
in a similar manner. 



C. Algorithms and quantities of interest 

In network studies much attention has focused on the 
statistics of the number of links (degree) of nodes in a 
network. The degree distribution, the maximum degree, 
the average, and higher moments of the distribution are 
also often considered. But keeping track of the degrees 
of all nodes under RSR is time consuming and seriously 
confines the system sizes and statistics of numerical stud- 
ies. In this paper we have performed numerical sim- 
ulations with degree measurements for networks up to 
A'* = 2.4 X 10^ nodes. 

As mentioned previously, the NZ algorithm can be 
adapted to keep track of cluster masses rather than their 
degrees. With the NZ algorithm large network sizes with 
high statistics can be simulated in a reasonable time. We 
have performed RSR with mass analyses on networks up 
to A/"* = 10^ nodes. Unfortunately all our efforts to track 
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FIG. 2. (color online) Time dependence of network size, N, in 
rescaled units. The size decreases monotonically under RSR. 
Data is obtained from ER graphs with {ky — 2 under RSR 
with 6=1. The curves with dashed lines are obtained by 
averaging t values corresponding to fixed N ^ and the curves 
with solid lines are obtained by averaging N for fixed t. The 
magenta (gray in print) solid line shows the mean-field theory 
prediction (see Sec. IV). The two averages differ when mean- 
field theory breaks down due to fluctuations. In the rest of the 
paper we choose N as the independent variable and average 
all other quantities at fixed N . Numbers in the legend show 
the initial size of the ER graph, A/"*, from which the initial 
giant components are obtained. 



the degrees of the nodes using the NZ or other algorithms 
have led to extremely long running times, thus we restrict 
our analysis to degrees of smaller networks and measure 
only masses for larger ones. As far as critical behavior is 
concerned, we show that mass and degree distributions 
lead to similar conclusions. 



D. Averaging over the ensemble 

When discussing ensemble averages, one can use dif- 
ferent quantities as independent control parameters. In 
particular, one can average over RSR trajectories at fixed 
or at fixed t. As shown in Fig. |2| these two ways of 
averaging give different results at late times (and hence 
small A^) , due to large fluctuations in the number of nodes 
eliminated per RSR step in the hub dominant phase. In 
the same figure we also show the result of a mean-field 
theory (MFT) discussed in section IV. During the initial 
stages of the flow, MFT gives an accurate description 
of RSR, but breaks down when different ensembles lead 
to different results. Some RSR flows last much longer 
time than others and since we want to keep the number 
of members in the ensemble more or less fixed to obtain 
each data point, we choose to average at fixed TV (rather 
than t) in the rest of this paper. 
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FIG. 3. (color online) Plot of kme.^/No vs. N/Nq for ER 
graphs with (/c)* = 2 and several initial sizes. Note that the 
direction of the RSR flow is from right to left. While kmax/No 
is close to zero in the mean- field regime, the hub at late times 
absorbs a finite and increasing fraction of the nodes. The 
transition gets sharper with increased system size; Inset: sim- 
ilar behavior for the rescaled maximal cluster mass Mmax/A^o- 
Note that Mmax always increases monotonically under RSR, 
whereas A;max has to finally decrease. Using the Newman- Ziff 
algorithm mass related properties can be measured on much 
larger systems than degree related properties. 



III. SIMULATION RESULTS FOR ER GRAPHS 
WITH INITIAL (k)* - 2 

We focus in detail on the behavior of the giant com- 
ponent of ER graphs with (A:)* = 2 under RSR with 
6 = 1. For these graphs the average degree of the gi- 
ant component is {k)o = 2.4. We find evidence for a 
continuous 'agglomerative percolation' transition in the 
evolution of these networks under RSR. The transition 
is associated with the emergence of a giant hub or the 
percolation of a giant cluster on the network. We study 
scaling properties at this transition and measure the cor- 
responding critical exponents numerically. We show that 
these exponents obey scaling relations associated with 
percolation, although RSR represents a different univer- 
sality class than ordinary percolation, even in the mean- 
field limit. 



A. Evidence for a phase transition 

We begin by studying the behavior of the maximum 
degree /cmax, as a function of N/Nq^ for various initial 
system sizes, as shown in Fig. [3j Note that the direction 
of the renormalization flow - or time^ is from right to left. 
The initial ER graph has a narrow Poisson degree distri- 
bution with no hubs and /cmax/^o is O(l/A^o)- As RSR 
aggregates nodes locally, although higher degree nodes 
appear in the system, /cmax/^o remains small. However, 
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FIG. 4. (color online) The largest degree /cmax and the second 
largest degree, A:max,2, are of comparable size in the mean- field 
regime, but in the hub regime a giant hub takes over and the 
second largest degree shrinks. This behavior is also consistent 
with a continuous percolation transition, and shows that there 
is only one outstanding hub (cluster) in every network. The 
inset shows the same behavior for the largest and the second 
largest mass. The data is obtained from ER graphs with 

(ky = 2. 

as shown in Fig. [sj k^a^/No suddenly at N/Nq ~ 0.7, 
starts to increase more rapidly. This implies the ex- 
istence of at least two regimes in the evolution of ER 
graphs under RSR: First, a no-hub (or mean-field; see 
Sec. IV) regime, where the degree distribution is narrow, 
fluctuations are negligible, and a mean- field theory de- 
scribes the evolution of the system; Second, a hub regime 
where a growing hub exists and our mean-field theory 
breaks down. This is due to large fluctuations as well as 
the effect of loops in the network. Loops are present in 
the networks initially, but they typically are large and 
the graphs are locally tree-like. As RSR proceeds, these 
loops become shorter and the graphs no longer remain 
locally tree-like. As indicated in Fig. [Sj the transition 
between these two regimes becomes sharper on increas- 
ing system size A^o- 

The same behavior can be observed for the mass of 
the larger cluster, Mmax, as shown in the inset of Fig. |3] 
Initially m = 1 for all nodes. Although clusters grow 
under the renormalization flow in the mean- field regime, 
the maximum mass remains 0{1). In the critical region 
a node with the largest mass percolates and separates 
itself from the rest of the distribution both in terms of 
size and degree. 

This is also indicated in Fig. [4j where both /cmax and 
the second largest degree /Cmax,2 are plotted vs. N/Nq. 
While the two largest degrees are about the same size 
in the mean- field regime, after the transition the largest 
hub grows and the second largest degree shrinks, which 
is another indication of a percolation transition. Similar 
behavior for M^ax and the second largest mass, Mmax,2, 




FIG. 5. (color online) Cluster mass distribution at different 
stages of the RSR flow for ER graphs of N* = 10^ nodes, 
and (k)* = 2. This distribution broadens and approaches a 
power-law pm ^ as x = N/Nq decreases. The power- 

law is broadest at x*(No) = 0.645, for this system size. For 
No (X), the critical point converges to x* ^ Xc = 0.688 
(the green / dashed-dotted curve). For x < x* a giant clus- 
ter emerges and a gap expands between this cluster and the 
rest of the distribution. Note that the size distribution of the 
giant cluster has a shoulder on the right (unlike ordinary per- 
colation). This is due to the possibility of selecting the hub 
as a target node and is discussed in more detail in Sec. V. 



is shown in the inset. 

The detailed relation between mass and degree is 
discussed in the Appendix. No singular behavior in 
^max "^s. Mmax appears in the critical region, and this 
smoothness holds statistically for the mass and degree of 
other nodes as well. Thus either variable can be used 
to extract the critical properties of the phase transition. 
Since RSR with mass measurement is much faster using 
the NZ algorithm, we mostly base our discussions on the 
masses of nodes. 



B. Finite size scaling analysis 

In order to analyze the RG flow in the critical region, 
we perform a finite size scaling (FSS) analysis on a num- 
ber of quantities and their distributions. Initially all the 
nodes have mass m = 1. As shown in Fig.jSj the mass dis- 
tribution broadens with the number of RSR steps, until 
a power law distribution Pm ~ emerges in the crit- 
ical region. As the RSR flow continues, an expanding 
gap appears between the giant cluster and the rest of the 
clusters. As shown by the curve for x = N/Nq = 0.576 
(below the transition) , the peak corresponding to the gi- 
ant cluster has a pronounced shoulder on the right. This 
is different from ordinary percolation (OP), where the 
peak is featureless. As discussed in more detail in Sec. V, 
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FIG. 6. (color online) Convergence of the effective critical 
point, x*(A^o), to Xc = 0.688 as the system size increases. In- 
set: the critical point Xc and the exponent are consistent 
with the values Xc = 0.688 and — 1/zy = —0.225, as indicated 
by the slope of the straight line. 
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FIG. 7. (color online) Data collapse using the FSS ansatz in 
Eq. Q for the mass distribution at x*(A^o)- The exponent 
D = 0.6 gives the best data collapse, and r = 2.67 fits the 
power law (see the inset). These values are consistent with 
Eq. (|6|. 



this shoulder results from the giant cluster being chosen 
repeatedly as the target of RSR. These are rare events, 
but they have dramatic effects on the flow. 

Setting X = N/Nq^ the effective critical point for a fi- 
nite system, x^{No), is defined as the value at which the 
system has the broadest power-law in its mass distribu- 
tion. In Fig. |6) we illustrate the convergence of x* as 
the system size increases. The limiting value for infinite 
system size, Xc, is consistent with Xc = 0.688 as shown 
in the inset. 

To proceed further, we make a conventional scaling 



ansatz for the mass distribution of a finite system in 
terms of a homogeneous scaling function ^ 



where 



Pm = m-^g{nN^^'',m/N^) 



(x ^c)/ 



(1) 



(2) 



Note that such an ansatz is never perfect, and all crit- 
ical parameters discussed in the following are obtained 
by compromises to get the best overall data collapses 
for all quantities of interest, and by assuming the scal- 
ing relations between critical exponents implied by the 
FSS ansatz. A summary of all critical exponents, the 
equations defining them and the figures demonstrating 
numerical evidence, are given in Table [Tj 

Our best estimate for the critical point - mainly from 
Fig.[6j but also taking into account the consistency checks 
in subsection C - is 



0.688 ±0.002 



(3) 



The exponent 1/u in Eq. ([T]) , describing the convergence 
of X* to Xc with increase of system size, is determined to 
be 



0.225 ±0.005 



(4) 



The exponent giving the scaling of the maximum mass 
with system size (see Fig. [t]), is 

(5) 



D = 0.60 ±0.01 . 



It is related to the Fisher exponent r by demanding that 
there is 0(1) cluster of size > TV^P and using Eq. ^ ^: 



1±I^ 



r = 



D 



= 2.67 ±0.03 



(6) 



Degree distributions behave similar to the mass distri- 
butions. The initial ER graph has a Poisson degree dis- 
tribution. As RSR proceeds, higher degree nodes appear 
and the degree distribution broadens. At the phase tran- 
sition local hubs join together to make a single hub much 
larger than all others. Just before the giant hub emerges, 
the degree distribution is approximately a power law with 
a power that is consistent with the power r of the mass 
distribution. Afterward, the hub continues to grow, but 
not forever. Figure |8] shows the degree distribution at dif- 
ferent values of X = N/Nq. At x ~ 0.7 the distribution 
resembles a power law, at x ^ 0.6 a bump appears at the 
rightmost end of the distribution. Later, when x < 0.6, 
a growing gap forms between the giant hub and the rest 
of the nodes. This continues until the shrinking system 
size forces the degree of the giant hub to decrease. 



C. Consistency checks 

In this subsection we check for consistency of our sim- 
ulations with the scaling theory based on the FSS ansatz 
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TABLE I. Summary of critical exponents for ER graphs with (/c)* —2 under RSR with h — \. All exponents are obtained by 
best compromise for the data collapses shown in the figures listed in column 5, and by requiring the scaling relations in column 
3 to hold, except for the exponent ol (last line). The critical exponents are clearly different from those of mean- field ordinary 
percolation (column 6) and for mean- field agglomerative percolation (last column). 
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FIG. 8. (color online) Degree distributions for N* — 1.2 x 
10^ at different stages of the RSR flow. The initial, narrow 
distribution gets broader and approaches a power law pk = 
k~^'^ close to the transition. Then a giant hub stands out and 
a gap opens between the hub and the rest of the nodes. 
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FIG. 9. (color online) Scaling plot of Mmax in the critical 
region. The value of Xc = 0.688 in here and the following 
figures is the same as in Fig. |6] and the critical exponents are 
those given in Table I. The inset shows that the exponents i^k 
and Pk for the maximum degree are the same as those for the 
maximum mass within error. 



in Eq. ([T]), by showing data collapses for different quan- 
tities of interest. Notice that the well known scaling re- 
lations between critical exponents [6 follow from Eq. ([T]) 
by considering appropriate limits. 



1. The order parameter 

An order parameter is any property of a system that 
can unravel the singularity at the critical point, which is 
non-zero only on one side of the transition. Typically Poo 7 
the probability that a given site belongs to the percolat- 
ing cluster, is considered as an order parameter for perco- 
lation. For RSR on graphs both A^max /No and Mrnax/A^o, 
can be used as order parameters. Notice that the latter 
is equal to Poo- 

An ESS ansatz for M^ax follows by multiplying Eq. ([T]) 
by m^, integrating over m, and taking the limit y ^ 00. 



Using also Eq. d6| gives 



^max 7 / iy-rl/v\ 

= HnNo' ) . 



(7) 



Assume now that h{z) satisfies a power law, h{z) ^ 
for z 0. Equation ^ gives then in the supercritical 
case n < (where we expect M^ax Nq) 



D = 1 



v 



and 



M„ 



No 



(8) 



(9) 



Figure [9] shows a data collapse according to Eq. ([7|), 
with Xc = 0.688 and critical exponents as given in Table 1. 
The analogous FSS ansatz for /cmax, 

^^ = MnNy'^^), (10) 
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FIG. 10. (color online) Data collapse for the second largest 
mass and the second largest degree. Again v is taken from 
Table I, while a is fitted for optimal collapse. The value of ak 
used in the inset is equal to a, which is here a — 0.395. 



FIG. 11. (color online) Scaling plot of the second moment 
of the mass distribution, (m^), for ER graphs, with ^jv — 
0.17 ± 0.03. The inset shows the same plot for {m^)ex- The 
same exponents are obtained. 



with Vk = y and = /3 is shown in the inset of Fig. |9] 
The exponents for maximal mass and degree are equal 
within our errors. 



2. The cutoff scale for the cluster size distribution 

The size of the second large cluster, Mmax,2 (resp. 
^max,2), determines the cutoff for the finite clusters (ex- 
cluding the hub). An FSS ansatz based on Eq. ([T]) gives 



max, 2 



1/(71/ 



h2{nNl''') 



and for the infinite system limit 

The exponent a is related to other exponents by 

0.375 ±0.015 . 



1 



(11) 



(12) 



(13) 



One can write similar equations for /cmax,2- Figure [To] 
shows data collapse plots with I/gu = l/cr^i^ = 0.57 for 
the second largest mass and degree. This leads to 



<j = cT/e = 0.395 



(14) 



This estimate was chosen as it gives the best data col- 
lapse. It is consistent with the value obtained in Eq. (13), 
within error. 
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FIG. 12. (color online) Scaling plot for the variance of the 
maximum degree, with Xc — 0.688, Xfv — 0.225 and a^lv — 
1.53 ±0.02. 



S. Average cluster size 

The average size of the cluster to which a randomly 
chosen node of the original network belongs is equal to 
the second moment of the mass distribution. An FSS 
ansatz for the average cluster size can be written as 



■7/zy 



(15) 



9 



and in the limit of A^o 



oo 



The exponent 7 obeys the scahng relation 
7= (2L)-l)i/ = 0.88 ±0.10 



(16) 



(17) 



Figure [TT] shows the corresponding FSS analysis, with 
j/jy = 0.17 chosen for an optimal data collapse. Within 
errors, this is consistent with the value 7/^^ = 0.20 ±0.20 
obtained in Eq. (17). In the inset we show the second 



moment of the mass distribution excluding the largest 
cluster, {w?)exi with the same scaling exponents. The 
exponent 7/^ for the degree moment is found to be the 
same as that for the mass moment within error (data not 
shown) . 



4. Variance of the maximal cluster size 

The variance of /Cmax and M^ax also diverge at the 
critical point. Because of technical problems we do not 
have precise values of the latter, and we concentrate on 
the variance of /cmax- It should scale as 



Var[k^ 



n: 



(18) 







Figure 12 shows the corresponding scaling plot. In OP 
the standard deviation of the order parameter has the 
same critical exponent as the order parameter itself [28^, 
implying v — ^ = a/2. This is not what we find for 
RSR on ER graphs, if we assume a = a^. To clarify 
this we directly looked at the distribution of /c^ax at x* 
for each of the system sizes shown in Fig. 12 (data not 
shown). The distribution is fiat on the left side, but 
has an approximate power-law tail on the right. The 
fiuct nations in this case grow faster than the average, 
unlike in ordinary percolation. We thus believe that the 
observed violation of the scaling relation is not due to 
a but shows that the relation v — j3 = a/2 is 

violated in AP. 

This stems from the difference in the growth process in 
agglomerative and ordinary percolation. Adding a bond 
(or site) in OP might merge only a few clusters into the 
giant cluster, leading to an additive growth of its size 
(and degree). In contrast, if the hub is chosen as the 
RSR target, it absorbs all its neighboring clusters. This 
leads to multiplicative growth. Thus in RSR we expect 
to see larger relative fiuctuations in the hub size near the 
transition comparing to OP (see also Sec. V). 



IV. MEAN-FIELD THEORY AND AN 
ANNEALED MODEL 

We now approach the problem analytically using a 
mean-field theory (MFT) based on generating func- 
tions ^29^. We will show that the critical exponents for 



mean-field RSR do not agree with the ones for ordinary 
mean-field percolation. 



A. General formalism 

Let Uk be the number of nodes with degree k. The total 
number of nodes in the network is = Uk and the 
probability of picking a node with degree k is pk = n^/N. 
The change of Uk in one step of RSR can be written as 
the sum of a loss term rk associated with eliminating a k- 
degree node and a gain term associated with creating 
one. 



duk 
dt 



Tk^Qk . (19) 



The loss term rk is 



rk 



-Pk 



/ kpk 



■Pk' =-{k^l)pk . (20) 



The first term in the central expression is the probabil- 
ity of targeting a k-degiee node, and the second term is 
the probability that any of the neighbors of the target 
have degree k. Note that the mean-field assumption is to 
ignore any potential correlations between the degrees of 
neighboring nodes. 

In order to obtain an equation for dN/dt one does not 
need to know Qk in detail; one just has to know that ex- 
actly one new node is created, whence qk = 1. Sum- 
ming Eq. (19) over k leads then indeed to 



dN 
~dt 



= -{k) 



(21) 



as expected from the fact that all neighbors of a randomly 
chosen node are eliminated in one RSR step. 

To get Qk^ assume that the target has m neighbors 
with degrees /ci, /c2, .../Cm- The new degree of the target 
will be the number of its second nearest neighbors. If all 
degrees are uncorrelated and the target's neighbors are 
not connected among themselves, 

m 

(22) 



We use generating functions to proceed. The degree 
distribution is generated by 



and Qk by: 



Q{x) = "^QkX^ • 



(23) 



(24) 



The degree distribution of the neighbors of the target is 
proportional to kpk^ thus their remaining degree is gen- 
erated by 



E/c kpk {k) 



(25) 
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FIG. 13. (color online) Comparison between the annealed 
model (AM), mean- field theory (MFT), and ER graphs with 
(fe)* = 2. There is good agreement between theory and data 
in the mean-field regime x > Xc- After the transition the 
effect of loops in ER graphs can no longer be ignored and 
results in smaller (fc) for ER graphs. 



Equation (22) gives then 
' g'{x) 



= G 



G jx) 



(26) 



Using Eqs. ( |19[ ) through ([26| one can write the master 
equation for the generating function of the degree distri- 
bution as 



G\^\+{k-l)G{x)-xG{x) 



(27) 



where a = {k)o — 2 and the subscript zero refers to the 
initial value. 

To test the results of MFT we have simulated RSR 
for an annealed random graph model (AM) in the follow- 
ing way: We start with the degree sequence of the giant 
component of the ER graphs studied in the previous sec- 
tion, remembering for each of the Nq nodes its degree, 
but remove all links. During each RSR step we first pick 
a random target node and read its degree k. Then we 
pick k other random nodes i = this time with 

probabilities proportional to their degrees ki. Einally we 

update the degree of the target to k^ = (^Yli ~ ^ and 

discard the other k nodes. 

dsol) 



Eigure 13 compares Eq. ( |3Q| ) to the simulation results 
of the AM and of the model discussed in the last sec- 
tion starting with ER graphs. In all three cases we used 
(/c)* = 2. Due to loops in the ER graphs, the average de- 
gree of the ER graphs is always less than or equal to that 
of the AM or MFT. Note that ER graphs are locally tree- 
like and the effect of loops can be ignored initially. Thus 
before the transition - in the mean-field regime - there 
is complete agreement between the results of MFT, the 
AM, and the ER graphs. But after the transition, the ef- 
fect of loops as well as fluctuations (which we will discuss 
later), result in a breakdown of the mean- field assump- 
tions and the average degree of the ER graphs no longer 
agrees with the other two cases. 

In the mean-field regime the system size, can be 
found as an implicit function of t by using Eqs. (21) 
and ( 30 ) to get 



t 



N-Nf 



aNo 







In 



• 2N/No 



(31) 



This result is shown in Fig. [2] and is in good agreement 
with simulation results in the mean-field regime. 



C. Divergence of degree fluctuations 



B. The average degree 

The moments of the distribution can be obtained from 
d 



dx 



G{x) 



(28) 



One can check that the time derivative of the zeroth mo- 
ment is zero, i.e. normalization is correct. The time 
derivative of the first moment is given by 



^^^^ = ^ [{k? - 2{k)] . 



dt 



(29) 



Also within MFT, the variance of the degree distri- 
bution diverges at the transition point. For ease of cal- 
culations we switch to factorial moments of the degree 
distribution obtained by consecutive derivatives of the 
generating function. 



{k^i^Y- 



{k(k- 

d 

dx 



m 



1)) 



(32) 



l)...{k- 
G{x) 

Using Eq. (|27|, the time derivative of the second fac 



torial moment is 
d 



dt 



N 



2 \2 



{ky 



+ (fe) - 3 



(33) 



Using Eq. ([21| to convert to the derivative with respect We next define a variable u = {k^_)/{k), and use 



to N (for subtleties in this see [21 ) and integrating gives 

aNo 



{k) 



N 



•2, 



(30) 



Eqs. ([21|, ([29|) and ([30|) to get 

du dN 



dx 



aNo + 2N 



a -\- 2x 



(34) 
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FIG. 14. (color online) FSS analysis of the second moment of 
the degree distribution, (/c^), close to criticality for the AM. 
The values of Xc = 0.718, and 7 = 0.5 used in the plot are 
those obtained from the MFT. While the exponent l/v — 
0.225 =b 0.015, obtained from the FSS data collapse is similar 
to ER graphs, the exponent 7 is different from 7 = 0.88 in 
ER graphs. 



with X = N/Nq. Integrating this equation leads to 

9 a -\- 2x 



; + 2x ' 



where 



(35) 



(36) 



and uq is the initial value of u. Since the average degree 
(k) does not diverge at the transition, the divergence of 
u is the same as the divergence of the variance of the 
degree distribution. The quantity u diverges when the 
denominator of Eq. (35) vanishes, so the critical point is 
at 



^c = ^ = i[c-(Wo-2)] 



(37) 



Equations ([36]), and ( [37| ) result in Xc = 0.718 ... for the 
AM model with {k)o = 2.4 we study here (notice that the 
initial degree distribution is not strictly Poissonian due 
to the restriction to the giant component of the original 
ER graph). Substituting n = {x — Xc)/xc into Eq. (35) 
we get 



{k^) u n ^ with 7 = 1/2 . 
For a finite system, we make the FSS ansatz 



(38) 



(39) 



Figure 14 shows an FSS analysis of (A:^) for the annealed 



model, close to criticality, and for several system sizes. 
The values of x^ = 0.718 and 7 = 0.112/0.225 = 0.5 
used in the FSS analysis are taken from MFT and give 
excellent agreement. 



D. Other critical exponents 



To get all other exponents in MFT we use Eqs. (27) 



and ( 32 ) to find the time derivative of the third factorial 
moment, h = {k^_) 



dh 



(40) 



where the dots stand for terms that are less divergent for 
X ^ Xc. Together with du/dt ^ /N -\- . . . this gives 
near the critical point 



\nh ^ u ^ n' 



-1/2 



,1/n 



1/2 



(41) 



suggesting that the third moment has an essential sin- 
gularity. The latter seems to be contradictory to scaling 
theory, but it really is not, and there exists a consis- 
tent solution showing these features. Assume the scaling 
ansatz 



Pk = k ^/(/c//Ccutoff) 



(42) 



for the degree distribution near the critical point, with 
^cutoff diverging at x ^ Xc- For u to diverge, r must be 
< 3. If T were strictly <C 3, we would have u ^cutoff 



^cutoff' there would be a power relation between 



them: h ~ u'^'^ ^) ^ The only way to obtain u r^hih 

is by having a logarithmic divergence of the sum k'^Pki 



(43) 



In order to have 7 



1/2, one needs furthermore /^cutoff ^ 
Nq^ giving then also Eq. (41). The fact that 
^cutoff diverges faster than a power for x 



Xc means 

that there is no simple scaling theory near the transition 
due to the singularity. 



E. Limiting behavior for (k)* 1 

In the limit (/c)* = 1, the giant components of ER 
networks become trees with {k)o = 2. Since trees remain 
trees during RSR, (k) = 2 during the entire RSR flow. 



consistent with Eq. ( |29| ). On the other hand, (/c^) does 
increase with t. Equation (35) leads to 



2x 



2x 



(44) 



12 




and Eq. (37) gives 



Xn 



>0 



(45) 



This is in contrast to the result of [ST , where we found 
Xc = for critical trees. Indeed, the limit (/c)* 1 of 
the present model is not the model of critical trees that 
was treated in [21 . 

This follows from how the critical trees of [2T1 and 
critical ER graphs are generated. In ER graphs links 
are distributed among nodes completely at random. If a 
node is picked at random, the degrees of all its neighbors 
are distributed according to 



(46) 



and there is no further structure. In contrast, the critical 
trees of [21 are generated by a critical random branching 
process that starts from one particular node and imprints 
on them a rooted structure. Therefore, if a node is picked 
randomly, there are relations that hold seperately for its 
mother and its daughters. While the degree distribution 
for the mother satisfies Eq. (46) with k replaced by /c — 1, 
the degree distribution of the daughters is simply p/^. One 
might think that this subtle difference can be neglected 
in a mean-field approximation, but this is not true: Since 
each RSR step affects three generations of nodes, a con- 
sistent grandmother-mother-daughter relationship has an 
effect on the RSR fiow. But it is not intuitively clear why 
this small difference has such a strong infiuence on the 
threshold for AP. Notice that an even more surprising 
dependence on minor details, leading indeed to a viola- 
tion of universality, is seen also in AP on 2-dimensional 
lattices [24 . 

We did not study the case (/c)* = 1 numerically, be- 
cause the size of the largest component in critical ER 

2 /3 

graphs of size A^o is ~ A^q^ , making it very difficult to 
create large initial connected graphs. 



V. FLUCTUATIONS IN THE HUB PHASE 

If the giant cluster (or hub) is itself a target of RSR, 
the size of the network decreases significantly in that time 
step. This gives rise to large fiuctuations in the size of 
the network. 

Figure [15] shows a scatter plot of t vs. N in rescaled 
units, for an ensemble of ER networks with = 1.2 x 
10^ and (/c)* = 2. The x and y axes in this plot are 
coarse grained into 500 and 200 bins respectively, giving 
100,000 pixels. The color of each pixel represents the 
frequency of this (A", t) pair relative to the frequency of 
the most populated pixel. As one can see, an envelope 
exists corresponding to the largest A' at a given time (and 
biggest time for a given N). There is also a second band 
(of high probability) which corresponds to intermediate 
A". A third but weak band also appears at smaller system 
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FIG. 15. (color online) Scatter plot of t vs. N in rescaled 
units for ER graphs with (A;)* = 2 and A* = 120 x 10^. The 
color map shows the relative frequency of each (A, t) pair 
in the ensemble. The main, intermediate and weak bands 
correspond to realizations where the giant hub has been hit 
zero, one or two times, respectively. The inset shows the 
probability that the giant hub is targeted / times by RSR. 



sizes, which is more difficult to distinguish due to the 
considerable ffuctuations. 

The inset of Fig. [15] shows the fraction of realizations 
in which the giant hub is hit / times, conditioned on 
^max/A'o > O-l- Most oftcu the hub is hit only once, 
and never more than three times. Evidently, the enve- 
lope (the uppermost band) comes from the realizations in 
which the giant hub was not hit at all, the intermediate 
band results from cases where the giant hub is hit once, 
and the third weak band is due to rare cases where the 
giant hub is hit twice. 

The slopes of the main bands are also informative. The 
uppermost band starts with slope in agreement with 



Eq. (21). At final stages, where the structure is star-like 



(as discussed in the following section), the bands have 
slope —1, which means that in most cases a leaf is tar- 
geted and thus one node is removed in one time step. The 
wide range of values for realizations as shown in Fig. 15 
explains why in Fig. [2] averaging over t at fixed A' gave a 
different result than averaging over A" at fixed t. 

The distribution of times T for the networks to reach 
A" = 1 corresponds to the leftmost column in Fig. 15 
This distribution has a shoulder where the uppermost 
band hits the y-axis, (at t/No ^ 0.47) and a peak where 
the second one hits it {t/No ^ 0.38). These distribu- 
tions show perfect data collapses for different system sizes 
(data not shown). For networks with (k)* > 2 the shoul- 
der turns into a second peak which grows and becomes 
the dominant peak on increasing (/c)*. It should disap- 
pear for (/c)* 1. 
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FIG. 16. (color online) The relative maximum degree, k = 
kma^/{N — 1), VS. N/Nq for different system sizes. The plot 
shows that the network is star-like at late times since k ap- 
proaches 1. Inset: Variance of k vs. N/Nq. Although the 
qualitative behavior of graphs under RSR at late times is the 
same as for the (quasi-) parallel renormalization method [18] , 
the quoted exponents are different. 



A. Scaling behavior at late times 

Eventually as the networks shrinks, kmax starts to de- 
crease and at the same time the network topology moves 
towards a star-like structure. 

The relative size of the largest hub hz = kmax/{N — 1) 
is a good measure for the similarity of a graph to a star 
which is a graph whose nodes are at most a distance two 
apart. Figure 16 shows and its variance x as a function 



of the relative system size. As one can see, at late stages 
of RSR is close to one, and thus the network has a 
star-like structure. 

The star-like regime was also observed in previous 
(quasi-) parallel methods used for renormalizing net- 
works [IHl [19] . Comparing our analysis with those stud- 
ies, RSR shows scaling and criticality in the flow at early 
times that was not picked up previously, because the 
renormalization steps in the quasi-parallel method were 
too large and jumped over the agglomerative percolation 
transition. Thus only the scaling at late times was ob- 
served in [ISl [ISj . Although the graphs under RSR look 
qualitatively similar to those obtained with the quasi- 
parallel method at late times, the quoted exponents are 
different (our Fig. 16 should be compared with Fig. 1 in 
Ref. ilS,). 



B. The star regime 

We define A^^ to be the last size of the network one 
step before it collapses into a single node. By definition 



FIG. 17. (color online) Data collapse for the distribution of 
the last sizes. The distribution follows the FSS ansatz in 
Eq. (|47| with Ts = 1.40 ± 0.15 and Ds = 0.25 ± 0.05, except 
for the leftmost points. The reason for their special behavior 
is given in [21j . 



the network has to be a pure star at this point, 
ure 



Fig- 

T7| shows a data collapse for the distribution of A^^ 
for ER graphs of different sizes. It is a broad distribution 
following the scaling ansatz 



1 



(47) 



with Ts = 1.40 ± 0.15 and = 0.25 ± 0.05. 

The exponents and Ds are similar to the ones ob- 
tained for critical trees [21 . This suggests universality in 
the final structure of the graphs, regardless of the start- 
ing structure, as the graph collapses into a single node 
and all original structure is lost. 



VI. STARTING WITH OTHER AVERAGE 
DEGREES 

Up to now we studied the behavior of ER graphs with 
(/c)* = 2. Here we discuss the effect of the initial average 
degree on RSR flow, still considering ER graphs. 

Figure [18] represents /Cmax for ER graphs with different 
values of (/c)*. The figure demonstrates similar critical 
behavior for these networks. For higher initial average 
degree the transition gets sharper and moves to the right, 
i.e. towards earlier times. This is also predicted from 
Eq. (37). For larger (A:)*, Xc approaches 1. Note that 
both (^0 and {k'^)o affect the position of 

Figure [19] shows an FSS analysis of Mmax for ER 
graphs with (/c)* = 4. The critical point Xc = 0.865 ± 
0.010 and the exponents 1/u = 0.215 ± 0.030 and 
= 0.62±0.05 are obtained by finding the best data 
collapse. The value of Xc is in agreement with Eq. (37) 
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FIG. 18. (color online) Rescaled maximum degree, kma^/No 
vs. system size for ER graphs with N* — 30000. The transi- 
tion shifts to the right with increase of (/c)*. 
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FIG. 19. (color online) Scaling of Mmax for ER graphs with 
(k)* = 4. The values Xc = 0.865 ± 0.010, l/i^ = 0.215 ± 0.030 
and 1 — P/v = 0.62 dt 0.05, obtained by finding the best data 
collapse agree with Eq. (371 and the exponents for ER graphs 




0.7 0.8 

N/No 



FIG. 20. (color online) Flow of the order parameter under 
RSR with box radius b — 2 for ER graphs of (/c)* —2 and 
several sizes. The data shows a sharp transition at early times, 
but a clean FSS analysis including precisely locating the crit- 
ical point is not numerically tractable. 



with {ky —2 within error bars. 



tion at early times. Figure 20 shows the order parameter 
as a function of N/Nq for networks of different system 
sizes under RSR with b = 2. Although one can clearly 
see evidence for a phase transition at early times, ex- 
trapolating the critical point for the infinite system with 
precision is not possible. 

One of the main differences between the b = 1 case 
and 6 > 1 is that for larger box radii there is no star 
regime. Once the network has diameter two, it will die 
in the next step with probability one. 

Another point to mention is that with any box size 
larger than one, the possibility to incorporate the hub at 
any step is large. The reason is that RSR with 6 > 1 is 
performed by targeting the same node b successive times. 
Although the target itself is not likely to be the hub, it is 
likely that it is the neighbor of the hub and thus merges 
with it. Hitting the same node again means then hitting 
the hub with high probability. With this argument any 
box size higher than one is similar to a weighted RSR, 
where nodes are being targeted with probability propor- 
tional to their mass or degree. 



and the exponent v and /3 agree with those for (/c)* =2, 
within our error estimates. For even higher average de- 
grees (not shown) the exponents still agree with the ones 
obtained for (/c)* =2, although the error bars are rather 
large. 



VII. RSR WITH LARGER BOX SIZES 

In this section we study RSR with box radius 6 > 1 on 
ER graphs with (/c)* = 2. We see evidence for a transi- 



VIII. SCALE-FREE NETWORKS 

Models with broad or "scale-free" degree distributions 
are often more interesting in view of their application 
to real- world networks. We have studied RSR on the 
Barabasi- Albert model [4 . Figure 21 shows the behavior 
of the maximum degree under RSR. Since these networks 
are scale free at the beginning, the transition is pushed 
all the way to N/Nq = 1. There is perfect data collapse 
after the hubs are well established. 
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FIG. 21. (color online) FSS analysis of kma^/No for the BA 
model. The critical point is pushed towards one, and there is 
a perfect data collapse after the hubs are well established. 
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FIG. 22. (color online) FSS analysis of Mmax and {m^)ex 
for the BA model. The critical point is set at Xc = 1. The 
exponents 1/z/ = 0.18 ±0.02, D = 0.5 ±0.1 and = 0.25 ± 
0.03 are obtained from the data collapse. 



The critical point can also be obtained from Eq. (37). 
The value of (A:^)o - and thus also the value of uq = 
{k'^)o/{k)o ~ diverge for scale-free networks, giving Xc ^ 
1 - l/ul « 1. 

When analyzing mass distributions for renormalized 
scale-free networks, it can be argued that one should not 
give masses m = 1 to all nodes of the initial graph. In- 
stead one might assign to every node a mass equal to its 
degree, as this allows one to consider mass as a proxy for 
the degree of nodes in the simulation of the RG flow. 

This convention is used in Figure |22j which illustrates 
an FSS analysis for the maximum cluster mass, Mmax, 
and the second moment of the mass distribution exclud- 



ing the largest cluster, {nn?)ex^ in BA networks of several 
sizes. Setting the critical point at Xc = 1, we obtained 
l/v = 0.18 ± 0.02, D = 0.5 ± 0.1 and -f/v = 0.25 ± 0.03. 



IX. CONCLUSION 

In this paper we have extended Random Sequen- 
tial Renormalization (RSR) to several networks, namely 
Erdos-Renyi and scale-free networks. In each step of RSR 
only a local part of the network within a fixed distance 
from a randomly chosen node is coarse-grained into one 
node. This is in contrast to (quasi-) parallel RG schemes 
that tile and coarse-grain the whole network in one step - 
which, however, has to be broken up into sequential local 
sub-steps for technical reasons. Apart from simplicity of 
the algorithm, RSR generates considerably larger amount 
of statistics and allows for a more detailed analysis of the 
renormalization flow. RSR can be interpreted as a clus- 
ter growth process where at each step a randomly chosen 
cluster grows at its boundary by agglomerating to all its 
neighboring clusters. Hence, the fast Monte Carlo algo- 
rithm of Newman and Ziff [22l [23] for percolation can be 
used to simulate RSR on networks of up to millions - or 
even billions - of nodes. 

For all the graphs we studied, RSR leads to a continu- 
ous agglomerative percolation transition (AP) where the 
largest cluster (node) outgrows all others both in terms of 
its mass and degree. We found three universality classes 
(critical trees, sparse ER graphs and mean-field AP) for 
evolution of networks under RSR. For sparse ER graphs 
we derived the corresponding critical exponents numer- 
ically and found that the exponents obtained by analy- 
sis of the masses of the clusters are not different from 
the ones obtained by analyzing the degrees of the nodes. 
Since mass analysis can be performed much faster with 
the help of the NZ algorithm, we suggest that mass anal- 
ysis may be better suited to extracting scaling properties 
of large networks. Regardless of the initial average degree 
of the ER graph, we found the same critical exponents for 
the percolation transition, within error. At late stages of 
RSR, graphs experience a regime in which they switch to 
a star structure for 6 = 1. For both ER graphs and criti- 
cal trees this regime extends in the range 1 < < N^^. 

For scale-free networks the transition is forced to Xc = 
1. Hence our data collapse methods for finding the crit- 
ical exponents of scale-free networks are not as neat as 
for ER graphs, and this makes it hard to decide whether 
BA and ER networks are in the same universality class. 

While the scaling behavior of critical trees under 
RSR is similar to graph behavior under the (quasi- 
) parallel renormalization scheme studied by Radicchi et. 

[m Us], the percolation transition revealed by our 
method in the early stages of the RG flow is not seen in 
their analysis. We conjecture that it exists also there in 
principle, but it would be very hard to study due to the 
coarseness of their RG flow observation. At final stages 
RSR and parallel schemes lead to the same qualitative 
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picture, namely a star-like structure for 6 = 1, but the 
scaling behavior and the corresponding exponents are dif- 
ferent. 

The simplicity of RSR as well as the fact that it is a 
percolation process both for networks and lattices makes 
it a useful tool for studying complex networks. For real- 
world networks finite-size scaling analysis is not generally 
possible since every network has a fixed (finite) size. But 
even in that case high statistics of RSR fiow and the 
efficiency of the algorithm make it possible to study the 
scaling properties of individual large networks. 
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Appendix: relationship between mass and degree of 
nodes 



Figure [23| depicts the linear relation between /Cmax and 
^max in the critical region for ER graphs with (/c)* = 2 
and several system sizes. This shows that either of them 
can be used to extract renormalization ffow properties 



near the transition. Since RSR can be simulated much 
faster if we only measure the mass-related quantities (in- 
stead of degree), we suggest that the RG analysis in the 
critical region can be confined to mass-related quantities. 
In the final stages of the fiow {N/Nq < 0.3), /Cmax de- 
creases as it cannot exceed the number of nodes present 
in the system. M^ax, on the other hand, increases mono- 
tonically till the end of the process where M^ax = ^o- 

The correlation between mass and degree of clusters 



close to criticality is shown in Fig. 24 for ER graphs of 
N* = 1.2 X 10^ at N/No = 0.688. Each point in the 
scatter plot shows one (m, k) pair in the whole network. 
Also shown are the average degree {k)m of nodes of a 
given mass and the average mass {m)k of clusters with a 
given degree. In the inset we show the number of clusters 
with a given mass, and the number of nodes with a 
given degree, Wk- 

The average mass of nodes with degree one is close to 
1, which means that most of them have not been hit by 
RSR, and the average degree of clusters with mass one 
is more than two which means that the nodes that have 
not been hit by RSR keep their starting average degree. 
Since the two averages differ only for masses (degrees) 
less that ten, we suggest that one can use either of them 
to extract the properties of the percolation transition. 



[1] R. Pastor- S at orr as and A. Vespignani, Phys. Rev. E 63, 
066117 (2001) 

M. J. E. Newman, SIAM Rev. 45, 167 (2003) 
S. Boccaletti, V. Latora, Y. Moreno, M. Chavez, and 
D. U. Hwang, Phys. Rep. 424, 175 (2006) 
A.-L. Barabasi and R. Albert, Science 286, 509 (1999) 
H. E. Stanley, Introduction to Phase Transitions and 
Critical Phenomena (Oxford University Press, Oxford, 
1971) 

D. Stauffer and A. Aharony, Introduction to Percolation 
Theory, 2nd ed. (Taylor k Francis, 1994) 
S. Milgram, Psychology Today 1, 61 (1967) 
D. J. Watts and S. H. Strogatz, Nature 393, 440 (1998) 
C. Song, S. Havlin, and H. A. Makse, Nature 433, 392 
(2005) 
C. Song, L. 
Stat. Mech., 

C. Song, S. 
275 (2006) 

S.-H. Yook, F. Radicchi, and H. Meyer- Ortmanns, Phys. 
Rev. E 72, 045105(R) (2005) 

J. S. Kim, K. L Goh, G. Salvi, E. Oh, B. Kahng, and 

D. Kim, Phys. Rev. E 75, 016110 (2007) 
J. S. Kim, K. I. Goh, B. Kahng, and D. Kim, New J. 
Phys. 9, 177 (2007) 

J. S. Kim, K. 1. Goh, B. Kahng, and D. Kim, Chaos 17, 



K. Gallos, S. Havlin, and H. A. Makse, J. 
P03006(2006) 

Havlin, and H. A. Makse, Nature Physics 2, 



[16 

[17: 
[is: 

[19 

[2o: 

[21 

[22: 

[23: 

[24 
[25: 

[26: 

[27 
[28 
[29^ 



026116 (2007) 

K.T. Goh, G. Salvi, B. Kahng, and D. Kim, Phys. Rev. 
Lett. 96, 018701 (2006) 

H. D. Rozenfeld, C. Song, and H. A. Makse, Phys. Rev. 
Lett. 104, 025701 (2010) 

F. Radicchi, J. J. Ramasco, A. Barrat, and S. Fortunato, 
Phys. Rev. Lett 101, 148701 (2008) 

F. Radicchi, A. Barrat, S. Fortunato, and J. J. Ramasco, 
Phys. Rev. E 79, 026104 (2009) 

K. J. Falconer, The Geometry of Fractal Sets (Cambridge 
Univ. Press, 1985) 

G. Bizhani, V. Sood, M. Paczuski, and P. Grassberger, 
Phys. Rev. E 83, 036110 (2011) 

M. E. J. Newman and R. M. Ziff, Phys. Rev. Lett. 85, 
4104 (2000) 

M. E. J. Newman and R. M. Ziff, Phys. Rev. E 64, 016706 
(2001) 

C. Christensen, G. Bizhani, S.-W. Son, M. Paczuski, and 
P. Grassberger, arXiv:1012.1070-to appear in EPL(2011) 
S.-W. Son, G. Bizhani, C. Christensen, P. Grassberger, 
and M. Paczuski, EPL 95, 58007 (2011) 
S.-W. Son, C. Christensen, G. Bizhani, P. Grassberger, 
and M. Paczuski, Phys. Rev. E 84, 040102(R) (2011) 

H. W. Lau, et al. in preparation 

M. Z. Bazant, Phys. Rev. E 62, 1660 (2000) 

M. E. J. Newman, S. H. Strogatz, and D. J. Watts, Phys. 

Rev. E 64, 026118 (2001) 



17 




FIG. 23. (color online) Log- log plot of normalized maximum 
degree /cmax/A^o vs. Mmax/ATo for ER graphs with (/c)* =2 
and size iV* = 1.2 X 10^ These are proportional to each 
other in a region close to criticality which extends over larger 
domains with increase of system size. While Mmax increases 
with A^o monotonically, /cmax is confined to the current system 
size N and starts to decrease deep in the hub phase. The inset 
shows the ratio of maximal mass and degree on a linear scale. 
The curve is linear in the critical region. 




FIG. 24. (color online) Scatter plot of masses and degrees of 
all clusters close to criticality. The data is obtained from one 
RSR trajectory of an ER graph of iV* = 1.2 x 10^ nodes, with 
{ky = 2 at N/No = 0.688. The red hne with circles shows 
the average degree of clusters of a given mass, and the green 
line with squares shows the average masses of nodes with a 
given degree. The inset shows the number of clusters of a 
given mass (wm) and the number of nodes of a given degree 



